A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR
نویسنده
چکیده
This paper describes a novel and efficient noise-robust frontend that utilizes a set of Mel-filterbank output compensation methods, together with cumulative distribution mapping of cepstral coefficients, for noisy speech recognition. The proposed compensation framework includes the use of noise spectral subtraction, spectral flooring and log Mel-filterbank output weighting. Recognition experiments on the Aurora II connected digit database have revealed that the proposed front-end achieves an average digit recognition accuracy of 83.46% for a model set trained from clean data. Compared with the recognition results obtained by using the ETSI standard Mel-cepstral front-end, these results represent a relative error reduction of around 58%.
منابع مشابه
A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
In this paper a multi-channel speech enhancement framework for distant speech acquisition in noisy and reverberant environments for Non-negative Matrix Factorization (NMF)-based Automatic Speech Recognition (ASR) is proposed. The system is evaluated for its use in an assistive vocal interface for physically impaired and speech-impaired users. The framework utilises the Spatially Pre-processed S...
متن کاملCepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments
This paper presents a set of acoustic feature pre-processing techniques that are applied to improving automatic speech recognition (ASR) performance on noisy speech recognition tasks. The principal contribution of this paper is an approach for cepstrum-domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed for noisy sp...
متن کاملAcoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments
This paper presents a set of acoustic feature pre–processing techniques that are applied to improving automatic speech recognition (ASR) performance on the Aurora 2 noisy speech recognition task. The principal contribution of this paper is an approach for cepstrum domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed ...
متن کاملPitch-Adaptive Front-End Features for Robust Children's ASR
In the presented work, we explore some of the challenges in recognizing children’s speech on automatic speech recognition (ASR) systems developed using adults’ speech. In such mismatched ASR tasks, a severely degraded recognition performance is observed due to the gross mismatch in the acoustic attributes between those two groups of speakers. Among the various sources of mismatch, we focus on t...
متن کاملA Novel Front-end Based on Variable Frame Rate Analysis and Mel-filterbank Output Compensation for Robust ASR
For automatic speech recognition (ASR) systems, robustness in the presence of various types and levels of environmental noise remains an important issue, despite the various advances of recent years. This paper describes a new noise-robust ASR front-end employing a combination of variable frame rate processing based on the sample-by-sample delta energy parameter, Melfilterbank output compensati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005